Sequences Classi cation by Least General Generalisations
نویسندگان
چکیده
In this paper, we present a general framework for supervised classi cation. This framework provides methods like boosting and only needs the de nition of a generalisation operator called lgg. For sequence classi cation tasks, lgg is a learner that only uses positive examples. We show that grammatical inference has already de ned such learners for automata classes like reversible automata or k-TSS automata. Then we propose a generalisation algorithm for the class of balls of words. Finally, we show through experiments that our method e ciently resolves sequence classi cation tasks.
منابع مشابه
Convex Hulls in Concept Induction
Classi cation learning is dominated by systems which induce large num bers of small axis orthogonal decision surfaces This strongly biases such systems towards particular hypothesis types but there is reason believe that many domains have underlying concepts which do not involve axis orthog onal surfaces Further the multiplicity of small decision regions mitigates against any holistic appreciat...
متن کاملSequences Classification by Least General Generalisations
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...
متن کاملDNA Sequence Analysis Using Hierarchical ART-based Classification Network
Adaptive resonance theory ART describes a class of arti cial neural net work architectures that act as classi cation tools which self organize work in real time and require no retraining to classify novel sequences We have adapted ART networks to provide support to scientists attempting to catego rize tandem repeat DNA fragments from Onchocerca volvulus In this approach sequences of DNA fragmen...
متن کاملCloud Classi cation Using Error-Correcting Output Codes
Novel arti cial intelligence methods are used to classify 16x16 pixel regions (obtained from Advanced Very High Resolution Radiometer (AVHRR) images) in terms of cloud type (e.g., stratus, cumulus, etc.). We previously reported that intelligent feature selection methods, combined with nearest neighbor classi ers, can dramatically improve classi cation accuracy on this task. Our subsequent analy...
متن کاملClassi cation of Membrane Proteins by Types of Transmembrane Helices Using SOSUI System
There are many amino acid sequences in proteomes which are not homologous to any other sequences. Therefore, methods to classify proteins independent of the sequence homology are strongly required for computational analysis of proteomes. Proteins may be divided into two categories: soluble and membrane proteins. Since membrane proteins are characterized by the existence of long hydrophobic tran...
متن کامل